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In applications of White's Density Matrix Renormaliza- 
tion Group (DMRG) algorithm, computation time is domi- 
nated by the diagonalisation of large sparse Hamiltonians by 
iterative diagonalisation algorithms, whose convergence can 
be decisively accelerated by the usage of good start vectors. 
In this paper I show how, using the Marshall sign rule, in 
a wide class of antiferromagnetic models the number of di- 
agonalization iterations can be reduced below 10, sometimes 
down to 2, accelerating the DMRG by an order of magni- 
tude. This acceleration, applicable during the growth of long 
chains, complements the acceleration procedure proposed by 
White. To illustrate the feasibility of the approach, I show 
how it performs if applied to the calculation of the Haldane 
gap for S — 2. 
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I. INTRODUCTION 

In recent years, the density matcix renormalization 
group algorithm proposed by WhiteS in 1992, has be- 
come together with exact diagonalisation and quan- 
tum Monte Carlo methods the algorithm of choice in 
low-dimensional quantum models, including magnetic, 
fermionic and bosonic systems. 

White has proposed two different DMRG algorithms, 
the so-called finite-size and infinite-size algorithm. The 
infinite-size algorithm precedes the application of the 
finite-size algorithm: a chain is grown symmetrically to 
its final size, whereas the finite-size algorithm treats the 
fully-sized system splitting it asymmetrically, which in- 
creases the precision of the results. Sometimes, it is how- 
ever advantageous to stick to the infinite-size algorithm 
and increase its precision by increasing the number of 
block states M. For example, when the Hamiltonian is 
invariant under reflection and parity thus a good quan- 
tum number, it is of great advantage to retain this quan- 
tum number, for easier classification of states and for 
thinning out the Hilbert space by splitting it into more 
sectors invariant under the operations of the Hamilto- 
nian. This gives access to more states and speeds up the 
algorithm. 

In either case, DMRG precision is dominated by the 
number of block states M , and time consumption scales 
as M 3 . The most time-consuming part of the algorithm 
is the determination of low lying eigenstates of the ap- 
proximate Hamiltonians that are formed during the iter- 
ative application of the decimation process of the DMRG. 
To do this, iterative diagonalisation algorithms such as 



LanczosB'B are used. They can be accelerated if a good 
initial guess for the targeted state is available, as the 
number of (Lanczos) iterations drops. 

During the application of the finite size algorithm, a 
previous ground state for full-length system under con- 
sideration as well as the (incomplete) DMRG basis trans- 
formations are available. White has used this informa- 
tion to make a very good guess for the initial state of the 
diagonalization algorithrnu, basically carrying out basis 
transformations on the previous ground state. During 
the application of the infinite size algorithm, previous 
ground state(s) were all obtained on shorter chains, such 
that a basis transformation is not feasible. What I want 
to show in this paper is how in the infinite size algorithm 
the old information can be used to also make a good 
initial guess. 

What the DMRG essentially does is to find a fixpoint 
in density matrix space, i.e. for large systems, essentially 
the same (incomplete) transformation from a basis with 
MN states (M block states and N spin states Vio a new 
(decimated) basis with M states is carried outfi. A first 
guess would be that the target state converges to a fix- 
point in Hilbert space also, implying that simply using 
the target state found in the last iteration as the initial 
state vector should be a good guess. 

This is not so: the overlap between the initial guess and 
the result is typically far from 1, and (fi|7i|t>i), the en- 
ergy expectation value of the initial guess is much larger 
than the true target state energy. One finds that for 
longer chains, the absolute values of wave function coef- 
ficients (if labeled suitably, see below) converge fast to 
fixed values, while the signs vary randomly. If one could 
predict the sign changes, the old wave function would 
indeed provide an excellent starting point. The random- 
ness in signs has two origins. First, real eigenvectors are 
only determined up to a global sign, which is attributed 
unpredictably by the density matrix diagonalization algo- 
rithm. Second, there are deterministic sign changes with 
chain length. We will see how the latter can be used to 
fix the global signs of eigenvectors such that almost all 
signs of the new wave function are correctly reproduced 
in the old wave function, which then can serve as excel- 
lent prediction for the new wave function. 



II. PREDICTION MECHANISM 

Let us consider, to simplify the description, an 
isotropic Heisenberg antiferromagnetic spin chain with 
integer spin length S. Let us assume that we have 
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reached a certain block length L, such that the total 
chain under consideration has length 2L + 2. Let us call 
the block states of the block of length L \m' L ), those of 
the block of length L + 1 m^+i), with total magnetisa- 
tions S*i and St., , . If we call a the spin state on site 
L + l, and the magnetisation S*, the DMRG decimation 
procedure yields 



(m' L a\m L+1 ) ^ 



with 



si 



(1) 



(2) 



as necessary condition. Both the states of the block with 
length L and the block with length L + 1 are now ordered 
and numbered in the respective magnetisation sectors ac- 
cording to their importance, i.e. the associated eigenvalue 
of the density matrix when they were formed. I assume 
now that the respective magnetisation sectors for the 
blocks of length L and L+l have equal numbers of states. 
This is normally the case as soon as chains exceed cer- 
tain, rather small lengths; if not, typically one or two 
states of extremely small weigth are redistributed. One 
can easily modify the proposed procedure to take this 
into account, which for reasons of clarity I will not dis- 
cuss; all results shown below were obtained by switching 
off the prediction mechanism in these rare circumstances. 
Otherwise the condition could be ensured by adding or 
dropping a state which has very little weight, at minor 
loss of precision. 

Fundamental rule. If this assumption holds, the global 
sign of the eigenvector of the density matrix giving 
|min) is chosen such that 



(-lf mL + 1 (m L a\m L+1 ) > 



(3) 



for the case where = m^+i, meaning that the states 
have equal label number (I will call them "equivalent"). 
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FIG. 1. DMRG chains for different block lengths L and 
L + l with sublattice structure. Sublattices are labeled such 
that central spins remain on the same sublattice. Blocks are 
represented by rectangles. Note the shift in sublattice position 
of block spins during system growth. 

How can thiSprule be motivated? Let us consider Mar- 
shall's sign ruleo. In its original form it states that for a 
bipartite connected (i.e. that only sites on different sub- 
lattices A and B interact and that each site can be linked 



to each other site by a chain of bonds) isotropic antifer- 
romagnet the lowest energy state \ip M ) in a sector with 
total magnetisation M can be written in the Ising basis 



,(-1) 



-S+Sf, \ 
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such that c„ > 0. The |n) are the Ising basis states, the 
sum in the exponent effectively yields the total magneti- 
sation of sublattice B. Marshall's sign rule applies to the 
two most interesting states for the DMRG, the ground 
and first excited states. Let us compare the two chains 
arising in the DMRG procedure shown in Figure 1, for 
block length L and block length L + l. If we assign the 
same sublattice positions to the center spins, the posi- 
tions of the spins in the L-block shift their position from 
one sublattice to the other. Let us now take the two (say, 
ground state) wave functions 

H>l) = ^rn' L *i*r m i \m l L a l a r m r L ) (5) 

= ^H+x^mE+x \ m L+\° l ° T m L+\l (6) 

If we make now the assumption that the DMRG is close 
enough to its fix point that we may identify both sys- 
tems, the wave function coefficients will be (almost) iden- 
tical but for the sign, as sublattice positions of spins have 
changed. This is confirmed by numerical evidence, and 
motivates calling states equivalent; while physically dis- 
tinct, they make (almost) the same contribution to (phys- 
ically also distinct) wave functions as also supported by 
the arguments of Ref. ^. It remains to determine the sign. 
If we consider two equivalent states and |m£ + i), the 
matrix element (mLO~\mL + i) is non-zero only if S* = 0. 
Expanding |tol) in the Ising basis, the signs of the wave 
function coefficients will, up to a global sign, be given by 
(— l)[ sz h ; where [S z ]^ is the sum over the magnetisation 
on sublattice B of Ising state n. If we identify now |mi) 
in \ipL) with |tol+i) in one sees that the added 

spin (sitting on sublattice ^4) makes no sign contribution, 
while the spins of the L-block have changed sublattice, 
such that the sign will be (— l)^ s '•». The relative sign 
change will be 

(7) 

The last expression is independent of the underlying Ising 
states and leads immediately to rule (||). 

Majority rule. Retracing the above argument, its cen- 
tral weakness is the assumption that block states can be 
identified for different lengths. Calculating the overlap 
between predicted wave functions and calculated wave 
functions, it is found to have increased to typically well 
above 0.9, but this does not yet lead to a large increase 
in performance of iterative diagonalizations. Thus, rule 
(Q) seems to catch an important point, but is also over- 
simplifying. Block identification implied that the added 
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spin had zero magnetisation, which is physically not true; 
there will be other magnetisations contributing, in partic- 
ular in excited states. Rule (||) may therefore not apply. 

To go beyond, magnetisations S* ^ have to be con- 
sidered also. It seems that there is no strict rule to do 
this - also because the overlaps between states may be so 
small that the sign is up to numerical arbitrariness - but 
we have found the following procedure to work extremely 
well. 

The expression 

(m L+1 ,m' L+1 ) := sign (m' L a\m L +i) (m L (-a)\m' L+1 ) 

(8) 

where \itil) and \itil+i), and \m' L ) and \m' L+1 ) are equiv- 
alent states respectively, is evaluated. If it is negative, 
the sign is considered "wrong". One finds, that almost 
all states \rriL+i) have all signs (m_L+i, m' L+1 ) right, and 
that some states (typically less than 10 out of several 
hundred) have almost all signs wrong. The wrong signs 
of the "almost correct" states Im^+i) are typically those 
{ m L+ii where \m' L+1 ) is an "almost wrong" state. 

By globally flipping the sign of the few "almost wrong" 
states, nearly all signs can be made right; the majority 
character of the rule makes it extremely stable against 
numerical imprecisions. The fact that only few flips are 
necessary justifies the assumptions made for (|J): As we 
will show, the majority rule added to (||) ensures that the 
old wave function has an almost perfect overlap with the 
desired new wave function. 

One of several ways to see how this rule goes beyond 
is to imagine that we are adding two sites to a block, 
assuming that rriL and can be identified, with spin 

—a on the first and spin a on the second site. There is no 
shift in sublattice positions, one of the new spins makes 
a sign contribution (— l) s °. Therefore, 

sign [(-1) S - (m L (-a)a\m L+2 )] = 1, (9) 

for identified states. Inserting Yl m ' l m L+i)( m L+il> wc 
find 

sign [{-l) s °^(m L {-a)\m L+1 )(m' L+1 a\rn L+2 )] = 1. 

m' 

Now we don't make a statement about (mi(- 
(this would not go beyond (||)), but about a product. 
Identifying the decimations L — > L + 1 and L + 1 — > L + 2 
in the above for large IB, we obtain the majority rule at 
least for the dominant overlaps. 

Extensions. So far, we have only considered isotropic 
chains. Retracing the derivation of the Marshall 
sign rule, it becomes evident that (antiferromagnetic) 
anisotropies and dimerized interactions are compatible 
with the sign rule; we found the prediction to worku. In 
other cases, such as frustration, where the Marshall sign 
rule does not apply, Richter, Ivanov and Retzlaff have 
shown that for small frustration the weight of the states 



whose signs violate the sign rule is zero or very smallo. 
In such cases, the method will still lead to an efficiency 
gain, which gradually disappears with increasing frustra- 
tion. This has been confirmed numerically. We have 
not considered half-integer spins, as state parity alter- 
nates during chain growth; this implies that states that 
are two DMRG steps apart will be related, making the 
method more complicated. However, the sign rule still 
holds, and it should be possible to adapt the prescription 
to that situation. 

The whole procedure can be reformulated to leave the 
, transformation matrix invariant, and use it and Mar- 
shall's sign rule to change the ground state wave func- 
tion. In any case, no new information is introduced: all 
procedures that just use the results of the last decima- 
tion inherently assume that one is already at the DMRG 
fixpoint, where (but for the sign) wave function coeffi- 
cients do not change. To go beyond, one has to evaluate 
this assumption, e.g. by studying whether the last two in- 
complete transformations ttil-i — * m-L and toj, — > m^+i 
are (loosely speaking) identical, leading eventually to ex- 
pressions similar in spirit to Ref. [|. To be able to com- 
pare meaningfully two independently computed transfor- 
mations, with random signs independently attributed by 
the diagonalization algorithms, a sign- fixing procedure 
as presented in this paper has to be invoked in any case, 
whereas in the finite-size algorithm the sign effects are 
already incorporated in the wave function. Results are 
not presented here, as the further improvement in con- 
vergence we obtained was, though systematic, rather dis- 
appointing. 

III. PERFORMANCE 

The above two rules can be implemented very easily 
(30 or so lines at most), and consume almost no mem- 
ory and computation time. The gain is however striking. 
To illustrate that the above procedure is useful, we cal- 
culated the S — 2 Haldane gap using chains with total 
length L to t — 600, while keeping M — 400 block states, 
employing magnetisation and parity as good quantum 
numbers. The chain is long enough, that the expected 
L~ 2 convergence of the finite length gap can be observed, 
and a gap A = 0.0907(2) extrapolated (cf. Refs. |). The 
number of Lanczos iterations without prediction varies 
between ~ 60 for the ground state and ~ 70 for the first 
excitation. As can be seen in Fig. |^, the prediction makes 
the number of Lanczos iterations drop dramatically, once 
the chain exceeds a certain length. For L > 200, more 
than 30 percent of all Lanczos runs finish after 4 or less 
iterations, where it has to be kept in mind that 2 is the 
minimum number of iterations to establish that a result 
has already converged. Less than 15 percent take 10 or 
more iterations, and the average is 7 iterations only. 
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S=2 ground state 
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chain length 



FIG. 2. Number of Lanczos iterations needed for the 
ground state of a S = 2 chain vs. length. Note that the min- 
imum number of Lanczos iterations to establish convergence 
is 2. 



ence between the energy expectation value for the reused 
old ground state and the converged Lanczos result. For 
L > 200, the guess error relative to the ground state en- 
ergy is always better than 10~ 6 , mostly at or below 10~ 10 . 
This shows that the prediction rules are very powerful in- 
deed. Remaining errors also come from the fact that the 
assumption that the absolute weight of states does not 
change with chain length is of course only approximate. 

S=2 first excitation 
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3. Decadic logarithm of the relative guess error 
— E 'guess) I E final for the ground state of a S = 2 



FIG. 

(E final 

chain vs. length. 

The intermittent peaks are due to rearrangements of 
the block states which reflect the growing chain length. 
In these iterations, prediction works only partially (the 
same phenomenon can be observed in strong form be- 
low L = 200). The number of Lanczos iterations is 
closely connected to the guess error (Fig. 0), the differ- 
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FIG. 4. Number of Lanczos iterations needed for the first 
excitation of a S — 2 chain vs. length. 
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5. Decadic logarithm of the relative guess error 
— E guess)/ E final for the first excitation of a S = 2 



FIG. 
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chain vs. length. 
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Results for the first excitation are not quite as good, 
as was to be expected, but still the number of Lanczos 
iterations drops to about 20, with some peaks up to 40. 
This is still a saving up to a factor 3. The relative error 
goes down to 10~ 9 , which shows that the method is also 
efficient here. 

Let us close with the remark that on an alpha worksta- 
tion, these calculations could all be done overnight and 
that S — 2 is not an "easy" case for demonstration pur- 
poses: the very long correlation length is damaging for 
the underlying assumptions of our procedure. 

IV. CONCLUSION 

We have shown how using information about the na- 
ture of antiferromagnetic wave function provided by the 
Marshall sign rule can be used to strongly reduce the 
number of diagonalization steps in the infinite system 
growth phase of the DMRG, allowing the growth of very 
long high-precision chains in reasonable time. While 
the proposed procedure is not as completely versatile as 
White's finite chain procedurefl, it covers many impor- 
tant scenarios and can be used as complementary to his 
procedure. 
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